Reinforcement theory

Results: 290



#Item
241Stochastic control / Control theory / Partially observable Markov decision process / Markov decision process / Automated planning and scheduling / Reinforcement learning / Monte Carlo POMDP / Statistics / Dynamic programming / Markov processes

Point-based value iteration: An anytime algorithm for POMDPs Joelle Pineau, Geoff Gordon and Sebastian Thrun Carnegie Mellon University Robotics Institute 5000 Forbes Avenue Pittsburgh, PA 15213

Add to Reading List

Source URL: www.cs.cmu.edu

Language: English - Date: 2003-06-04 12:29:32
242Artificial intelligence / Outcome / Pareto efficiency / Nash equilibrium / Mathematical optimization / Agent-based model / Welfare economics / Reinforcement learning / Subgame / Game theory / Problem solving / Science

Agendas for Multi-Agent Learning Geoffrey J. Gordon December 2006 CMU-ML[removed]School of Computer Science Carnegie Mellon University

Add to Reading List

Source URL: www.cs.cmu.edu

Language: English - Date: 2006-12-20 14:23:06
243Knowledge / Academia / Artificial intelligence / Year of birth missing / Multi-agent systems / Reinforcement learning / Agent-based model / Nash equilibrium / Carnegie Mellon School of Computer Science / Science / Game theory / Formal sciences

Multiagent Learning in the Presence of Agents with Limitations Michael Bowling May 14, 2003 CMU-CS[removed]

Add to Reading List

Source URL: reports-archive.adm.cs.cmu.edu

Language: English - Date: 2003-07-21 09:06:25
244Stochastic control / Bayesian statistics / Artificial intelligence / Partially observable Markov decision process / Bayesian game / Action selection / Reinforcement learning / Markov decision process / Decision theory / Statistics / Dynamic programming / Markov processes

Approximate Solutions For Partially Observable Stochastic Games with Common Payoffs Rosemary Emery-Montemerlo, Geoff Gordon, Jeff Schneider School of Computer Science Carnegie Mellon University Pittsburgh, PA 15213

Add to Reading List

Source URL: www.cs.cmu.edu

Language: English - Date: 2004-06-25 15:30:50
245Stochastic control / Control theory / Partially observable Markov decision process / Reinforcement learning / Markov decision process / Automated planning and scheduling / Action selection / Bellman equation / Abstraction / Statistics / Dynamic programming / Markov processes

Policy-contingent abstraction for robust robot control Joelle Pineau, Geoff Gordon and Sebastian Thrun School of Computer Science Carnegie Mellon University Pittsburgh, PA 15213

Add to Reading List

Source URL: www.cs.cmu.edu

Language: English - Date: 2003-06-04 12:29:33
246Loss function / Reinforcement learning / Forcing / Price of anarchy / Game theory / Statistics / Mechanism design

No-Regret Learning and a Mechanism for Distributed Multiagent Planning Jan-P. Calliess Geoffrey J. Gordon

Add to Reading List

Source URL: www.cs.cmu.edu

Language: English - Date: 2008-02-18 10:33:25
247International Conference on Machine Learning / Reinforcement learning / Informatics Forum / Computational learning theory / Scottish Informatics and Computer Science Alliance / Artificial intelligence / Machine learning / Learning

ICML 2012 Handbook International Conference on Machine Learning June 26 - July 1, 2012 Edinburgh, Scotland, UK

Add to Reading List

Source URL: icml.cc

Language: English - Date: 2012-06-14 13:31:31
248Decision theory / Markov models / Mathematical optimization / Dynamic programming / Reinforcement learning / Markov decision process / Markov chain / Value of information / Variance / Statistics / Probability and statistics / Markov processes

Selecting Computations: Theory and Applications Nicholas Hay and Stuart Russell Computer Science Division University of California Berkeley, CA 94720

Add to Reading List

Source URL: www.cs.berkeley.edu

Language: English - Date: 2012-10-04 09:08:48
249Mathematical optimization / Dynamic programming / Equations / Operations research / Systems engineering / Reinforcement learning / Q-learning / Markov decision process / Bellman equation / Statistics / Systems theory / Control theory

State Abstraction for Programmable Reinforcement Learning Agents David Andre and Stuart J. Russell Computer Science Division, UC Berkeley, CA[removed]fdandre,[removed]

Add to Reading List

Source URL: www.cs.berkeley.edu

Language: English - Date: 2008-01-03 13:48:15
250Systems theory / Equations / Mathematical optimization / SARSA / Q-learning / Markov processes / Stochastic control / Reinforcement learning / Markov decision process / Statistics / Control theory / Dynamic programming

Q-Decomposition for Reinforcement Learning Agents Stuart Russell @.. Andrew L. Zimdars @..

Add to Reading List

Source URL: www.cs.berkeley.edu

Language: English - Date: 2003-06-03 00:44:40
UPDATE